Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 77
Filtrar
1.
PLoS One ; 19(4): e0300545, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38558075

RESUMO

Short tandem repeat (STR) variation is an often overlooked source of variation between genomes. STRs comprise about 3% of the human genome and are highly polymorphic. Some cause Mendelian disease, and others affect gene expression. Their contribution to common disease is not well-understood, but recent software tools designed to genotype STRs using short read sequencing data will help address this. Here, we compare software that genotypes common STRs and rarer STR expansions genome-wide, with the aim of applying them to population-scale genomes. By using the Genome-In-A-Bottle (GIAB) consortium and 1000 Genomes Project short-read sequencing data, we compare performance in terms of sequence length, depth, computing resources needed, genotyping accuracy and number of STRs genotyped. To ensure broad applicability of our findings, we also measure genotyping performance against a set of genomes from clinical samples with known STR expansions, and a set of STRs commonly used for forensic identification. We find that HipSTR, ExpansionHunter and GangSTR perform well in genotyping common STRs, including the CODIS 13 core STRs used for forensic analysis. GangSTR and ExpansionHunter outperform HipSTR for genotyping call rate and memory usage. ExpansionHunter denovo (EHdn), STRling and GangSTR outperformed STRetch for detecting expanded STRs, and EHdn and STRling used considerably less processor time compared to GangSTR. Analysis on shared genomic sequence data provided by the GIAB consortium allows future performance comparisons of new software approaches on a common set of data, facilitating comparisons and allowing researchers to choose the best software that fulfils their needs.


Assuntos
Genoma Humano , Repetições de Microssatélites , Humanos , Repetições de Microssatélites/genética , Software , Genômica , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala
2.
ERJ Open Res ; 10(1)2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38375425

RESUMO

Introduction: Idiopathic pulmonary fibrosis (IPF) is a chronic interstitial pneumonia marked by progressive lung fibrosis and a poor prognosis. Recent studies have highlighted the potential role of infection in the pathogenesis of IPF, and a prior association of the HLA-DQB1 gene with idiopathic fibrotic interstitial pneumonia (including IPF) has been reported. Owing to the important role that the human leukocyte antigen (HLA) region plays in the immune response, here we evaluated if HLA genetic variation was associated specifically with IPF risk. Methods: We performed a meta-analysis of associations of the HLA region with IPF risk in individuals of European ancestry from seven independent case-control studies of IPF (comprising 5159 cases and 27 459 controls, including a prior study of fibrotic interstitial pneumonia). Single nucleotide polymorphisms, classical HLA alleles and amino acids were analysed and signals meeting a region-wide association threshold of p<4.5×10-4 and a posterior probability of replication >90% were considered significant. We sought to replicate the previously reported HLA-DQB1 association in the subset of studies independent of the original report. Results: The meta-analysis of all seven studies identified four significant independent single nucleotide polymorphisms associated with IPF risk. However, none met the posterior probability for replication criterion. The HLA-DQB1 association was not replicated in the independent IPF studies. Conclusion: Variation in the HLA region was not consistently associated with risk in studies of IPF. However, this does not preclude the possibility that other genomic regions linked to the immune response may be involved in the aetiology of IPF.

3.
Genes Chromosomes Cancer ; 63(1): e23189, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37421230

RESUMO

Malignant pleural mesothelioma (MPM), a rare cancer a long latency period (up to 40 years) between asbestos exposure and disease presentation. The mechanisms coupling asbestos to recurrent somatic alterations are poorly defined. Gene fusions arising through genomic instability may create novel drivers during early MPM evolution. We explored the gene fusions that occurred early in the evolutionary history of the tumor. We conducted multiregional whole exome sequencing (WES) of 106 samples from 20 patients undergoing pleurectomy decortication and identified 24 clonal nonrecurrent gene fusions, three of which were novel (FMO9P-OR2W5, GBA3, and SP9). The number of early gene fusion events detected varied from zero to eight per tumor, and presence of gene fusions was associated with clonal losses involving the Hippo pathway genes and homologous recombination DNA repair genes. Fusions involved known tumor suppressors BAP1, MTAP, and LRP1B, and a clonal oncogenic fusion involving CACNA1D-ERC2, PARD3B-NT5DC2, and STAB2-NT5DC2 fusions were also identified as clonal fusions. Gene fusions events occur early during MPM evolution. Individual fusions are rare as no recurrent truncal fusions event were found. This suggests the importance of early disruption of these pathways in generating genomic rearrangements resulting in potentially oncogenic gene fusions.


Assuntos
Amianto , Neoplasias Pulmonares , Mesotelioma Maligno , Mesotelioma , Humanos , Mesotelioma Maligno/genética , Via de Sinalização Hippo , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patologia , Mesotelioma/genética , Reparo do DNA/genética , Fusão Gênica
4.
Nat Commun ; 14(1): 6713, 2023 10 23.
Artigo em Inglês | MEDLINE | ID: mdl-37872160

RESUMO

Thyroid hormones play a critical role in regulation of multiple physiological functions and thyroid dysfunction is associated with substantial morbidity. Here, we use electronic health records to undertake a genome-wide association study of thyroid-stimulating hormone (TSH) levels, with a total sample size of 247,107. We identify 158 novel genetic associations, more than doubling the number of known associations with TSH, and implicate 112 putative causal genes, of which 76 are not previously implicated. A polygenic score for TSH is associated with TSH levels in African, South Asian, East Asian, Middle Eastern and admixed American ancestries, and associated with hypothyroidism and other thyroid disease in South Asians. In Europeans, the TSH polygenic score is associated with thyroid disease, including thyroid cancer and age-of-onset of hypothyroidism and hyperthyroidism. We develop pathway-specific genetic risk scores for TSH levels and use these in phenome-wide association studies to identify potential consequences of pathway perturbation. Together, these findings demonstrate the potential utility of genetic associations to inform future therapeutics and risk prediction for thyroid diseases.


Assuntos
Hipertireoidismo , Hipotireoidismo , Doenças da Glândula Tireoide , Humanos , Tireotropina/genética , Estudo de Associação Genômica Ampla , Doenças da Glândula Tireoide/genética , Hipotireoidismo/genética , Hipertireoidismo/genética , Tiroxina
5.
medRxiv ; 2023 Jul 24.
Artigo em Inglês | MEDLINE | ID: mdl-37546732

RESUMO

Introduction: Idiopathic pulmonary fibrosis (IPF) is a chronic interstitial pneumonia marked by progressive lung fibrosis and a poor prognosis. Recent studies have highlighted the potential role of infection in the pathogenesis of IPF and a prior association of the HLA-DQB1 gene with idiopathic fibrotic interstitial pneumonia (including IPF) has been reported. Due to the important role that the Human Leukocyte Antigen (HLA) region plays in the immune response, here we evaluated if HLA genetic variation was associated specifically with IPF risk. Methods: We performed a meta-analysis of associations of the HLA region with IPF risk in individuals of European ancestry from seven independent case-control studies of IPF (comprising a total of 5,159 cases and 27,459 controls, including the prior study of fibrotic interstitial pneumonia). Single nucleotide polymorphisms, classical HLA alleles and amino acids were analysed and signals meeting a region-wide association threshold p<4.5×10-4 and a posterior probability of replication >90% were considered significant. We sought to replicate the previously reported HLA-DQB1 association in the subset of studies independent of the original report. Results: The meta-analysis of all seven studies identified four significant independent single nucleotide polymorphisms associated with IPF risk. However, none met the posterior probability for replication criterion. The HLA-DQB1 association was not replicated in the independent IPF studies. Conclusion: Variation in the HLA region was not consistently associated with risk in studies of IPF. However, this does not preclude the possibility that other genomic regions linked to the immune response may be involved in the aetiology of IPF.

6.
JHEP Rep ; 5(8): 100764, 2023 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-37484212

RESUMO

Background & Aims: Non-alcoholic fatty liver disease (NAFLD) is a complex trait with an estimated prevalence of 25% globally. We aimed to identify the genetic variant underlying a four-generation family with progressive NAFLD leading to cirrhosis, decompensation, and development of hepatocellular carcinoma in the absence of common risk factors such as obesity and type 2 diabetes. Methods: Exome sequencing and genome comparisons were used to identify the likely causal variant. We extensively characterised the clinical phenotype and post-prandial metabolic responses of family members with the identified novel variant in comparison with healthy non-carriers and wild-type patients with NAFLD. Variant-expressing hepatocyte-like cells (HLCs) were derived from human-induced pluripotent stem cells generated from homozygous donor skin fibroblasts and restored to wild-type using CRISPR-Cas9. The phenotype was assessed using imaging, targeted RNA analysis, and molecular expression arrays. Results: We identified a rare causal variant c.1691T>C p.I564T (rs745447480) in MTTP, encoding microsomal triglyceride transfer protein (MTP), associated with progressive NAFLD, unrelated to metabolic syndrome and without characteristic features of abetalipoproteinaemia. HLCs derived from a homozygote donor had significantly lower MTP activity and lower lipoprotein ApoB secretion than wild-type cells, while having similar levels of MTP mRNA and protein. Cytoplasmic triglyceride accumulation in HLCs triggered endoplasmic reticulum stress, secretion of pro-inflammatory mediators, and production of reactive oxygen species. Conclusions: We have identified and characterised a rare causal variant in MTTP, and homozygosity for MTTP p.I564T is associated with progressive NAFLD without any other manifestations of abetalipoproteinaemia. Our findings provide insights into mechanisms driving progressive NAFLD. Impact and Implications: A rare genetic variant in the gene MTTP has been identified as responsible for the development of severe non-alcoholic fatty liver disease in a four-generation family with no typical disease risk factors. A cell line culture created harbouring this variant gene was characterised to understand how this genetic variation leads to a defect in liver cells, which results in accumulation of fat and processes that promote disease. This is now a useful model for studying the disease pathways and to discover new ways to treat common types of fatty liver disease.

7.
Bioinformatics ; 39(4)2023 04 03.
Artigo em Inglês | MEDLINE | ID: mdl-36744935

RESUMO

SUMMARY: DeepPheWAS is an R package for phenome-wide association studies that creates clinically curated composite phenotypes and integrates quantitative phenotypes from primary care data, longitudinal trajectories of quantitative measures, disease progression and drug response phenotypes. Tools are provided for efficient analysis of association with any genetic input, under any genetic model, with optional sex-stratified analysis, and for developing novel phenotypes. AVAILABILITY AND IMPLEMENTATION: The DeepPheWAS R package is freely available under GNU general public licence v3.0 from at https://github.com/Richard-Packer/DeepPheWAS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Fenômica , Software , Fenótipo
8.
Immunogenetics ; 75(3): 201-206, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-36224278

RESUMO

Glycophorins are transmembrane proteins of red blood cells (RBCs), heavily glycosylated on their external-facing surface. In humans, there are four glycophorin proteins, glycophorins A, B, C and D. Glycophorins A and B are encoded by two similar genes GYPA and GYPB, and glycophorin C and glycophorin D are encoded by a single gene, GYPC. The exact function of glycophorins remains unclear. However, given their abundance on the surface of RBCs, it is likely that they serve as a substrate for glycosylation, giving the RBC a negatively charged, complex glycan "coat". GYPB and GYPE (a closely related pseudogene) were generated from GYPA by two duplication events involving a 120-kb genomic segment between 10 and 15 million years ago. Non-allelic homologous recombination between these 120-kb repeats generates a variety of duplication alleles and deletion alleles, which have been systematically catalogued from genomic sequence data. One allele, called DUP4, encodes the Dantu NE blood type and is strongly protective against malaria as it alters the surface tension of the RBC membrane. Glycophorins interact with other infectious pathogens, including viruses, as well as the malarial parasite Plasmodium falciparum, but the role of glycophorin variation in mediating the effects of these pathogens remains underexplored.


Assuntos
Doenças Transmissíveis , Glicoforinas , Humanos , Glicoforinas/genética , Glicoforinas/metabolismo , Eritrócitos/metabolismo , Eritrócitos/parasitologia , Proteínas de Membrana/genética , Variação Genética
9.
BMC Med Genomics ; 15(1): 119, 2022 05 21.
Artigo em Inglês | MEDLINE | ID: mdl-35597955

RESUMO

BACKGROUND: The role of copy number variants (CNVs) in susceptibility to asthma is not well understood. This is, in part, due to the difficulty of accurately measuring CNVs in large enough sample sizes to detect associations. The recent availability of whole-exome sequencing (WES) in large biobank studies provides an unprecedented opportunity to study the role of CNVs in asthma. METHODS: We called common CNVs in 49,953 individuals in the first release of UK Biobank WES using ClinCNV software. CNVs were tested for association with asthma in a stage 1 analysis comprising 7098 asthma cases and 36,578 controls from the first release of sequencing data. Nominally-associated CNVs were then meta-analysed in stage 2 with an additional 17,280 asthma cases and 115,562 controls from the second release of UK Biobank exome sequencing, followed by validation and fine-mapping. RESULTS: Five of 189 CNVs were associated with asthma in stage 2, including a deletion overlapping the HLA-DQA1 and HLA-DQB1 genes, a duplication of CHROMR/PRKRA, deletions within MUC22 and TAP2, and a duplication in FBRSL1. The HLA-DQA1, HLA-DQB1, MUC22 and TAP2 genes all reside within the human leukocyte antigen (HLA) region on chromosome 6. In silico analyses demonstrated that the deletion overlapping HLA-DQA1 and HLA-DQB1 is likely to be an artefact arising from under-mapping of reads from non-reference HLA haplotypes, and that the CHROMR/PRKRA and FBRSL1 duplications represent presence/absence of pseudogenes within the HLA region. Bayesian fine-mapping of the HLA region suggested that there are two independent asthma association signals. The variants with the largest posterior inclusion probability in the two credible sets were an amino acid change in HLA-DQB1 (glutamine to histidine at residue 253) and a multi-allelic amino acid change in HLA-DRB1 (presence/absence of serine, glycine or leucine at residue 11). CONCLUSIONS: At least two independent loci characterised by amino acid changes in the HLA-DQA1, HLA-DQB1 and HLA-DRB1 genes are likely to account for association of SNPs and CNVs in this region with asthma. The high divergence of haplotypes in the HLA can give rise to spurious CNVs, providing an important, cautionary tale for future large-scale analyses of sequencing data.


Assuntos
Asma , Variações do Número de Cópias de DNA , Alelos , Aminoácidos/genética , Asma/genética , Teorema de Bayes , Bancos de Espécimes Biológicos , Exoma , Predisposição Genética para Doença , Antígenos HLA-DQ/genética , Cadeias HLA-DRB1/genética , Haplótipos , Antígenos de Histocompatibilidade Classe I , Humanos , Reino Unido , Sequenciamento do Exoma
10.
iScience ; 25(5): 104189, 2022 May 20.
Artigo em Inglês | MEDLINE | ID: mdl-35494225

RESUMO

Discovering loci under balancing selection in humans can identify loci with alleles that affect response to the environment and disease. Genome variation data have identified the 5' region of the DMBT1 gene as undergoing balancing selection in humans. DMBT1 encodes the pattern-recognition glycoprotein DMBT1, also known as SALSA, gp340, or salivary agglutinin. DMBT1 binds to a variety of pathogens through a tandemly arranged scavenger receptor cysteine-rich (SRCR) domain, with the number of domains polymorphic in humans. We show that the signal of balancing selection is driven by one haplotype usually carrying a shorter SRCR repeat and another usually carrying a longer SRCR repeat. DMBT1 encoded by a shorter SRCR repeat allele does not bind a cariogenic and invasive Streptococcus mutans strain, in contrast to the long SRCR allele that shows binding. Our results suggest that balancing selection at DMBT1 is due to host-microbe interactions of encoded SRCR tandem repeat alleles.

11.
FASEB J ; 36(3): e22200, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35182405

RESUMO

Intelectins (intestinal lectins) are highly conserved across chordate evolution and have been implicated in various human diseases, including Crohn's disease (CD). The human genome encodes two intelectin genes, intelectin-1 (ITLN1) and intelectin-2 (ITLN2). Other than its high sequence similarity with ITLN1, little is known about ITLN2. To address this void in knowledge, we report that ITLN2 exhibits discrete, yet notable differences from ITLN1 in primary structure, including a unique amino terminus, as well as changes in amino acid residues associated with the glycan-binding activity of ITLN1. We identified that ITLN2 is a highly abundant Paneth cell-specific product, which localizes to secretory granules, and is expressed as a multimeric protein in the small intestine. In surgical specimens of ileal CD, ITLN2 mRNA levels were reduced approximately five-fold compared to control specimens. The ileal expression of ITLN2 was unaffected by previously reported disease-associated variants in ITLN2 and CD-associated variants in neighboring ITLN1 as well as NOD2 and ATG16L1. ITLN2 mRNA expression was undetectable in control colon tissue; however, in both ulcerative colitis (UC) and colonic CD, metaplastic Paneth cells were found to express ITLN2. Together, the data reported establish the groundwork for understanding ITLN2 function(s) in the intestine, including its possible role in CD.


Assuntos
Doença de Crohn/metabolismo , Lectinas/metabolismo , Celulas de Paneth/metabolismo , Vesículas Secretórias/metabolismo , Proteínas Relacionadas à Autofagia/genética , Proteínas Relacionadas à Autofagia/metabolismo , Humanos , Lectinas/genética , Proteína Adaptadora de Sinalização NOD2/genética , Proteína Adaptadora de Sinalização NOD2/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo
12.
Trends Genet ; 38(1): 45-58, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-34284881

RESUMO

Structural variation (SV) is a large difference (typically >100 bp) in the genomic structure of two genomes and includes both copy number variation and variation that does not change copy number of a genomic region, such as an inversion. Improved reference genomes, combined with widespread genome sequencing using short-read sequencing technology, and increasingly using long-read sequencing, have reignited interest in SV. Recent large-scale studies and functional focused analyses have highlighted the role of SV in human evolution. In this review, we highlight human-specific SVs involved in changes in the brain, population-specific SVs that affect response to the environment, including adaptation to diet and infectious diseases, and summarise the contribution of archaic hominin admixture to present-day human SV.


Assuntos
Variações do Número de Cópias de DNA , Hominidae , Animais , Variações do Número de Cópias de DNA/genética , Genoma , Genoma Humano/genética , Variação Estrutural do Genoma/genética , Genômica , Hominidae/genética , Humanos , Análise de Sequência de DNA
14.
Sci Rep ; 11(1): 15548, 2021 07 30.
Artigo em Inglês | MEDLINE | ID: mdl-34330944

RESUMO

Intelectins are a family of multimeric secreted proteins that bind microbe-specific glycans. Both genetic and functional studies have suggested that intelectins have an important role in innate immunity and are involved in the etiology of various human diseases, including inflammatory bowel disease. Experiments investigating the role of intelectins in human disease using mouse models are limited by the fact that there is not a clear one-to-one relationship between intelectin genes in humans and mice, and that the number of intelectin genes varies between different mouse strains. In this study we show by gene sequence and gene expression analysis that human intelectin-1 (ITLN1) has multiple orthologues in mice, including a functional homologue Itln1; however, human intelectin-2 has no such orthologue or homologue. We confirm that all sub-strains of the C57 mouse strain have a large deletion resulting in retention of only one intelectin gene, Itln1. The majority of laboratory strains have a full complement of six intelectin genes, except CAST, SPRET, SKIVE, MOLF and PANCEVO strains, which are derived from different mouse species/subspecies and encode different complements of intelectin genes. In wild mice, intelectin deletions are polymorphic in Mus musculus castaneus and Mus musculus domesticus. Further sequence analysis shows that Itln3 and Itln5 are polymorphic pseudogenes due to premature truncating mutations, and that mouse Itln1 has undergone recent adaptive evolution. Taken together, our study shows extensive diversity in intelectin genes in both laboratory and wild-mice, suggesting a pattern of birth-and-death evolution. In addition, our data provide a foundation for further experimental investigation of the role of intelectins in disease.


Assuntos
Citocinas/genética , Lectinas/genética , Animais , Evolução Molecular , Proteínas Ligadas por GPI/genética , Humanos , Laboratórios , Camundongos , Camundongos Endogâmicos C57BL , Filogenia , RNA Mensageiro/genética
16.
Sci Rep ; 11(1): 12889, 2021 06 18.
Artigo em Inglês | MEDLINE | ID: mdl-34145348

RESUMO

Intelectins are ancient carbohydrate binding proteins, spanning chordate evolution and implicated in multiple human diseases. Previous GWAS have linked SNPs in ITLN1 (also known as omentin) with susceptibility to Crohn's disease (CD); however, analysis of possible functional significance of SNPs at this locus is lacking. Using the Ensembl database, pairwise linkage disequilibrium (LD) analyses indicated that several disease-associated SNPs at the ITLN1 locus, including SNPs in CD244 and Ly9, were in LD. The alleles comprising the risk haplotype are the major alleles in European (67%), but minor alleles in African superpopulations. Neither ITLN1 mRNA nor protein abundance in intestinal tissue, which we confirm as goblet-cell derived, was altered in the CD samples overall nor when samples were analyzed according to genotype. Moreover, the missense variant V109D does not influence ITLN1 glycan binding to the glycan ß-D-galactofuranose or protein-protein oligomerization. Taken together, our data are an important step in defining the role(s) of the CD-risk haplotype by determining that risk is unlikely to be due to changes in ITLN1 carbohydrate recognition, protein oligomerization, or expression levels in intestinal mucosa. Our findings suggest that the relationship between the genomic data and disease arises from changes in CD244 or Ly9 biology, differences in ITLN1 expression in other tissues, or an alteration in ITLN1 interaction with other proteins.


Assuntos
Citocinas/genética , Trato Gastrointestinal/metabolismo , Regulação da Expressão Gênica , Variação Genética , Lectinas/genética , Alelos , Doença de Crohn/genética , Citocinas/química , Suscetibilidade a Doenças , Proteínas Ligadas por GPI/química , Proteínas Ligadas por GPI/genética , Loci Gênicos , Humanos , Lectinas/química , Especificidade de Órgãos/genética
17.
Nat Commun ; 12(1): 1751, 2021 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-33741915

RESUMO

Malignant Pleural Mesothelioma (MPM) is typically diagnosed 20-50 years after exposure to asbestos and evolves along an unknown evolutionary trajectory. To elucidate this path, we conducted multi-regional exome sequencing of 90 tumour samples from 22 MPMs acquired at surgery. Here we show that exomic intratumour heterogeneity varies widely across the cohort. Phylogenetic tree topology ranges from linear to highly branched, reflecting a steep gradient of genomic instability. Using transfer learning, we detect repeated evolution, resolving 5 clusters that are prognostic, with temporally ordered clonal drivers. BAP1/-3p21 and FBXW7/-chr4 events are always early clonal. In contrast, NF2/-22q events, leading to Hippo pathway inactivation are predominantly late clonal, positively selected, and when subclonal, exhibit parallel evolution indicating an evolutionary constraint. Very late somatic alteration of NF2/22q occurred in one patient 12 years after surgery. Clonal architecture and evolutionary clusters dictate MPM inflammation and immune evasion. These results reveal potentially drugable evolutionary bottlenecking in MPM, and an impact of clonal architecture on shaping the immune landscape, with potential to dictate the clinical response to immune checkpoint inhibition.


Assuntos
Deleção Cromossômica , Neoplasias Pulmonares/genética , Mesotelioma/genética , Mutação , Neoplasias Pleurais/genética , Proteínas Supressoras de Tumor/genética , Células Clonais/metabolismo , Células Clonais/patologia , Análise por Conglomerados , Estudos de Coortes , Humanos , Estimativa de Kaplan-Meier , Prognóstico , Microambiente Tumoral/genética , Proteínas Supressoras de Tumor/classificação , Sequenciamento do Exoma/métodos
18.
Ann Hum Genet ; 85(1): 7-17, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-32895931

RESUMO

Structural variation in the human genome can affect risk of disease. An example is a complex structural variant of the human glycophorin gene cluster, called DUP4, which is associated with a clinically significant level of protection against severe malaria. The human glycophorin gene cluster harbours at least 23 distinct structural variants, and accurate genotyping of this complex structural variation remains a challenge. Here, we use a polymerase chain reaction-based strategy to genotype structural variation at the human glycophorin gene cluster, including the alleles responsible for the U- blood group. We validate our approach, based on a triplex paralogue ratio test, on publically available samples from the 1000 Genomes project. We then genotype 574 individuals from a longitudinal birth cohort (Tori-Bossito cohort) using small amounts of DNA at low cost. Our approach readily identifies known deletions and duplications, and can potentially identify novel variants for further analysis. It will allow exploration of genetic variation at the glycophorin locus, and investigation of its relationship with malaria, in large sample sets at minimal cost, using standard molecular biology equipment.


Assuntos
Técnicas de Genotipagem , Glicoforinas/genética , Malária/genética , Benin , Genoma Humano , Genótipo , Humanos , Família Multigênica , Reação em Cadeia da Polimerase
19.
BMC Genomics ; 21(1): 446, 2020 Jun 29.
Artigo em Inglês | MEDLINE | ID: mdl-32600246

RESUMO

BACKGROUND: Approximately 5% of the human genome shows common structural variation, which is enriched for genes involved in the immune response and cell-cell interactions. A well-established region of extensive structural variation is the glycophorin gene cluster, comprising three tandemly-repeated regions about 120 kb in length and carrying the highly homologous genes GYPA, GYPB and GYPE. Glycophorin A (encoded by GYPA) and glycophorin B (encoded by GYPB) are glycoproteins present at high levels on the surface of erythrocytes, and they have been suggested to act as decoy receptors for viral pathogens. They are receptors for the invasion of the protist parasite Plasmodium falciparum, a causative agent of malaria. A particular complex structural variant, called DUP4, creates a GYPB-GYPA fusion gene known to confer resistance to malaria. Many other structural variants exist across the glycophorin gene cluster, and they remain poorly characterised. RESULTS: Here, we analyse sequences from 3234 diploid genomes from across the world for structural variation at the glycophorin locus, confirming 15 variants in the 1000 Genomes project cohort, discovering 9 new variants, and characterising a selection of these variants using fibre-FISH and breakpoint mapping at the sequence level. We identify variants predicted to create novel fusion genes and a common inversion duplication variant at appreciable frequencies in West Africans. We show that almost all variants can be explained by non-allelic homologous recombination and by comparing the structural variant breakpoints with recombination hotspot maps, confirm the importance of a particular meiotic recombination hotspot on structural variant formation in this region. CONCLUSIONS: We identify and validate large structural variants in the human glycophorin A-B-E gene cluster which may be associated with different clinical aspects of malaria.


Assuntos
Variação Estrutural do Genoma , Glicoforinas/genética , Malária Falciparum/genética , Pontos de Quebra do Cromossomo , Mapeamento Cromossômico , Bases de Dados Genéticas , Resistência à Doença , Humanos , Hibridização in Situ Fluorescente , Alinhamento de Sequência , Sequenciamento Completo do Genoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...